Revert "[SPARK-56975][SS] Reject user-specified schema in DataStreamReader.table()" by PorridgeSwim · Pull Request #56189 · apache/spark

PorridgeSwim · 2026-05-28T18:52:20Z

What changes were proposed in this pull request?

This reverts commit 05b4d81f3f938ff140886d6f66ad66d08c66d5b2 (SPARK-56975), which made DataStreamReader.table() reject a user-specified schema by calling assertNoSpecifiedSchema("table"). This restores the previous behavior, where a user-specified schema passed before .table() is accepted (and ignored).

Why are the changes needed?

SPARK-56975 is a behavior-breaking change. Code that previously ran successfully — e.g. spark.readStream.schema(s).table(name) — now throws an AnalysisException (_LEGACY_ERROR_TEMP_1189). While a schema has no effect on .table(), rejecting it outright breaks existing user workloads that set a schema on the DataStreamReader before calling .table().

A user-facing behavior change like this must go through the project's breaking-change process, which was not followed for SPARK-56975. We are reverting it to restore backward compatibility; a proper deprecation path can be pursued separately if the stricter behavior is still desired.

Does this PR introduce any user-facing change?

Yes. It restores the pre-SPARK-56975 behavior: DataStreamReader.table() again accepts (and silently ignores) a user-specified schema instead of throwing AnalysisException (_LEGACY_ERROR_TEMP_1189). Since SPARK-56975 only landed in unreleased branches (master and branch-4.2), there is no change relative to any released Spark version.

How was this patch tested?

This is a straight git revert. Existing DataStreamTableAPISuite tests pass; the test added by SPARK-56975 ("read: user-specified schema is not allowed with table API") is removed as part of the revert.

Was this patch authored or co-authored using generative AI tooling?

No.

…eader.table()" This reverts commit 05b4d81.

…eader.table()" ### What changes were proposed in this pull request? This reverts commit `05b4d81f3f938ff140886d6f66ad66d08c66d5b2` (SPARK-56975), which made `DataStreamReader.table()` reject a user-specified schema by calling `assertNoSpecifiedSchema("table")`. This restores the previous behavior, where a user-specified schema passed before `.table()` is accepted (and ignored). ### Why are the changes needed? SPARK-56975 is a behavior-breaking change. Code that previously ran successfully — e.g. `spark.readStream.schema(s).table(name)` — now throws an `AnalysisException` (`_LEGACY_ERROR_TEMP_1189`). While a schema has no effect on `.table()`, rejecting it outright breaks existing user workloads that set a schema on the `DataStreamReader` before calling `.table()`. A user-facing behavior change like this must go through the project's breaking-change process, which was not followed for SPARK-56975. We are reverting it to restore backward compatibility; a proper deprecation path can be pursued separately if the stricter behavior is still desired. ### Does this PR introduce _any_ user-facing change? Yes. It restores the pre-SPARK-56975 behavior: `DataStreamReader.table()` again accepts (and silently ignores) a user-specified schema instead of throwing `AnalysisException` (`_LEGACY_ERROR_TEMP_1189`). Since SPARK-56975 only landed in unreleased branches (`master` and `branch-4.2`), there is no change relative to any released Spark version. ### How was this patch tested? This is a straight `git revert`. Existing `DataStreamTableAPISuite` tests pass; the test added by SPARK-56975 (`"read: user-specified schema is not allowed with table API"`) is removed as part of the revert. ### Was this patch authored or co-authored using generative AI tooling? No. Closes #56189 from PorridgeSwim/revert-SPARK-56975. Lead-authored-by: You Zhou <you.zhou@databricks.com> Co-authored-by: You Zhou <98635051+PorridgeSwim@users.noreply.github.com> Signed-off-by: Anish Shrigondekar <anish.shrigondekar@databricks.com> (cherry picked from commit 6039af8) Signed-off-by: Anish Shrigondekar <anish.shrigondekar@databricks.com>

PorridgeSwim changed the title ~~Revert "[SPARK-56975][SS] Reject user-specified schema in DataStreamR…~~ Revert "[SPARK-56975][SS] Reject user-specified schema in DataStreamReader.table()" May 28, 2026

anishshri-db approved these changes May 29, 2026

View reviewed changes

Revert "[SPARK-56975][SS] Reject user-specified schema in DataStreamR…

6170a60

…eader.table()" This reverts commit 05b4d81.

PorridgeSwim force-pushed the revert-SPARK-56975 branch from e4042ab to 6170a60 Compare May 29, 2026 18:19

Merge branch 'apache:master' into revert-SPARK-56975

b0b3d0d

anishshri-db closed this in 6039af8 May 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert "[SPARK-56975][SS] Reject user-specified schema in DataStreamReader.table()"#56189

Revert "[SPARK-56975][SS] Reject user-specified schema in DataStreamReader.table()"#56189
PorridgeSwim wants to merge 2 commits into
apache:masterfrom
PorridgeSwim:revert-SPARK-56975

PorridgeSwim commented May 28, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

PorridgeSwim commented May 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

PorridgeSwim commented May 28, 2026 •

edited

Loading